Overview

Dataset Statistics

Number of Variables 13
Number of Rows 506
Missing Cells 0
Missing Cells (%) 0.0%
Duplicate Rows 0
Duplicate Rows (%) 0.0%
Total Size in Memory 51.5 KB
Average Row Size in Memory 104.3 B
Variable Types
  • Numerical: 11
  • Categorical: 2

Dataset Insights

CRIM is skewed Skewed
ZN is skewed Skewed
INDUS is skewed Skewed
AGE is skewed Skewed
TAX is skewed Skewed
PTRATIO is skewed Skewed
B is skewed Skewed
CHAS has constant length 3 Constant Length
ZN has 372 (73.52%) zeros Zeros

Variables


CRIM

numerical

Approximate Distinct Count 504
Approximate Unique (%) 99.6%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 8096
Mean 3.6135
Minimum 0.00632
Maximum 88.9762
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • CRIM is skewed right (γ1 = 5.2077)

Quantile Statistics

Minimum 0.00632
5-th Percentile 0.02791
Q1 0.08204
Median 0.2565
Q3 3.6771
95-th Percentile 15.7891
Maximum 88.9762
Range 88.9699
IQR 3.595

Descriptive Statistics

Mean 3.6135
Standard Deviation 8.6015
Variance 73.9866
Sum 1828.4429
Skewness 5.2077
Kurtosis 36.7528
Coefficient of Variation 2.3804
  • CRIM is not normally distributed (p-value 1.00191271223882e-24)
  • CRIM has 66 outliers

ZN

numerical

Approximate Distinct Count 26
Approximate Unique (%) 5.1%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 8096
Mean 11.3636
Minimum 0
Maximum 100
Zeros 372
Zeros (%) 73.5%
Negatives 0
Negatives (%) 0.0%
  • ZN is skewed right (γ1 = 2.2191)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 0
Q3 12.5
95-th Percentile 80
Maximum 100
Range 100
IQR 12.5

Descriptive Statistics

Mean 11.3636
Standard Deviation 23.3225
Variance 543.9368
Sum 5750
Skewness 2.2191
Kurtosis 3.9799
Coefficient of Variation 2.0524
  • ZN is not normally distributed (p-value 6.514643020988688e-25)
  • ZN has 68 outliers

INDUS

numerical

Approximate Distinct Count 76
Approximate Unique (%) 15.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 8096
Mean 11.1368
Minimum 0.46
Maximum 27.74
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • INDUS is skewed right (γ1 = 0.2941)

Quantile Statistics

Minimum 0.46
5-th Percentile 2.18
Q1 5.19
Median 9.69
Q3 18.1
95-th Percentile 21.89
Maximum 27.74
Range 27.28
IQR 12.91

Descriptive Statistics

Mean 11.1368
Standard Deviation 6.8604
Variance 47.0644
Sum 5635.21
Skewness 0.2941
Kurtosis -1.2332
Coefficient of Variation 0.616
  • INDUS is not normally distributed (p-value 4.006154866633497e-19)

CHAS

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.4%
Missing 0
Missing (%) 0.0%
Memory Size 34408
  • The largest value (0.0) is over 13.46 times larger than the second largest value (1.0)

Length

Mean 3
Standard Deviation 0
Median 3
Minimum 3
Maximum 3

Sample

1st row 0.0
2nd row 0.0
3rd row 0.0
4th row 0.0
5th row 0.0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 1012
  • The top 2 categories (0.0, 1.0) take over 50.0%
  • The largest value (00) is over 13.46 times larger than the second largest value (10)
  • CHAS has words of constant length

NOX

numerical

Approximate Distinct Count 81
Approximate Unique (%) 16.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 8096
Mean 0.5547
Minimum 0.385
Maximum 0.871
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • NOX is skewed right (γ1 = 0.7271)

Quantile Statistics

Minimum 0.385
5-th Percentile 0.4093
Q1 0.449
Median 0.538
Q3 0.624
95-th Percentile 0.74
Maximum 0.871
Range 0.486
IQR 0.175

Descriptive Statistics

Mean 0.5547
Standard Deviation 0.1159
Variance 0.01343
Sum 280.6757
Skewness 0.7271
Kurtosis -0.07586
Coefficient of Variation 0.2089

RM

numerical

Approximate Distinct Count 446
Approximate Unique (%) 88.1%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 8096
Mean 6.2846
Minimum 3.561
Maximum 8.78
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • RM is skewed right (γ1 = 0.4024)

Quantile Statistics

Minimum 3.561
5-th Percentile 5.314
Q1 5.8855
Median 6.2085
Q3 6.6235
95-th Percentile 7.5875
Maximum 8.78
Range 5.219
IQR 0.738

Descriptive Statistics

Mean 6.2846
Standard Deviation 0.7026
Variance 0.4937
Sum 3180.025
Skewness 0.4024
Kurtosis 1.861
Coefficient of Variation 0.1118
  • RM is not normally distributed (p-value 0.00017026524836535143)
  • RM has 30 outliers

AGE

numerical

Approximate Distinct Count 356
Approximate Unique (%) 70.4%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 8096
Mean 68.5749
Minimum 2.9
Maximum 100
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • AGE is skewed left (γ1 = -0.5972)

Quantile Statistics

Minimum 2.9
5-th Percentile 17.725
Q1 45.025
Median 77.5
Q3 94.075
95-th Percentile 100
Maximum 100
Range 97.1
IQR 49.05

Descriptive Statistics

Mean 68.5749
Standard Deviation 28.1489
Variance 792.3584
Sum 34698.9
Skewness -0.5972
Kurtosis -0.97
Coefficient of Variation 0.4105
  • AGE is not normally distributed (p-value 2.7780696281687133e-15)

DIS

numerical

Approximate Distinct Count 412
Approximate Unique (%) 81.4%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 8096
Mean 3.795
Minimum 1.1296
Maximum 12.1265
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • DIS is skewed right (γ1 = 1.0088)

Quantile Statistics

Minimum 1.1296
5-th Percentile 1.462
Q1 2.1002
Median 3.2074
Q3 5.1884
95-th Percentile 7.8278
Maximum 12.1265
Range 10.9969
IQR 3.0883

Descriptive Statistics

Mean 3.795
Standard Deviation 2.1057
Variance 4.434
Sum 1920.2916
Skewness 1.0088
Kurtosis 0.4713
Coefficient of Variation 0.5549
  • DIS is not normally distributed (p-value 0.0019445276361671796)
  • DIS has 5 outliers

RAD

categorical

Approximate Distinct Count 9
Approximate Unique (%) 1.8%
Missing 0
Missing (%) 0.0%
Memory Size 34540

Length

Mean 3.2609
Standard Deviation 0.4395
Median 3
Minimum 3
Maximum 4

Sample

1st row 1.0
2nd row 2.0
3rd row 2.0
4th row 3.0
5th row 3.0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 1144

TAX

numerical

Approximate Distinct Count 66
Approximate Unique (%) 13.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 8096
Mean 408.2372
Minimum 187
Maximum 711
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • TAX is skewed right (γ1 = 0.668)

Quantile Statistics

Minimum 187
5-th Percentile 222
Q1 279
Median 330
Q3 666
95-th Percentile 666
Maximum 711
Range 524
IQR 387

Descriptive Statistics

Mean 408.2372
Standard Deviation 168.5371
Variance 28404.7595
Sum 206568
Skewness 0.668
Kurtosis -1.143
Coefficient of Variation 0.4128
  • TAX is not normally distributed (p-value 6.761764732247678e-17)

PTRATIO

numerical

Approximate Distinct Count 46
Approximate Unique (%) 9.1%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 8096
Mean 18.4555
Minimum 12.6
Maximum 22
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • PTRATIO is skewed left (γ1 = -0.7999)

Quantile Statistics

Minimum 12.6
5-th Percentile 14.7
Q1 17.4
Median 19.05
Q3 20.2
95-th Percentile 21
Maximum 22
Range 9.4
IQR 2.8

Descriptive Statistics

Mean 18.4555
Standard Deviation 2.1649
Variance 4.687
Sum 9338.5
Skewness -0.7999
Kurtosis -0.2941
Coefficient of Variation 0.1173
  • PTRATIO is not normally distributed (p-value 4.6331955585016225e-20)
  • PTRATIO has 15 outliers

B

numerical

Approximate Distinct Count 357
Approximate Unique (%) 70.5%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 8096
Mean 356.674
Minimum 0.32
Maximum 396.9
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • B is skewed left (γ1 = -2.8818)

Quantile Statistics

Minimum 0.32
5-th Percentile 84.59
Q1 375.3775
Median 391.44
Q3 396.225
95-th Percentile 396.9
Maximum 396.9
Range 396.58
IQR 20.8475

Descriptive Statistics

Mean 356.674
Standard Deviation 91.2949
Variance 8334.7523
Sum 180477.06
Skewness -2.8818
Kurtosis 7.1438
Coefficient of Variation 0.256
  • B is not normally distributed (p-value 6.053682405291396e-24)
  • B has 77 outliers

LSTAT

numerical

Approximate Distinct Count 455
Approximate Unique (%) 89.9%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 8096
Mean 12.6531
Minimum 1.73
Maximum 37.97
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • LSTAT is skewed right (γ1 = 0.9038)

Quantile Statistics

Minimum 1.73
5-th Percentile 3.7075
Q1 6.95
Median 11.36
Q3 16.955
95-th Percentile 26.8075
Maximum 37.97
Range 36.24
IQR 10.005

Descriptive Statistics

Mean 12.6531
Standard Deviation 7.1411
Variance 50.9948
Sum 6402.45
Skewness 0.9038
Kurtosis 0.4765
Coefficient of Variation 0.5644
  • LSTAT has 7 outliers

Interactions

Correlations

Missing Values